Keyword data informs and impacts every other SEO task, including finding content topics, on-page SEO, email outreach, and content promotion. While a wealth of companies provide granular keyword data, a birds-eye view of Google Searches is missing in the industry. This large-scale analysis aims to fill the gap.
As a data partner, DataForSEO provided the raw data for the analysis through their keyword database [https://dataforseo.com/databases]. The analysis was limited to US keyword data that accounted for around 306 keyword searches. Keyword monthly searchers with volume of 0 were removed from the data set. The data was queried from a Google BigQuery database.
To attain additional insights from the data, we enriched a random sample of around 2.5 million keywords with the Ahrefs Keyword Explorer. This includes the data on difficulty, return rate, clicks, region volume, and SERP features.
Below an overview of the data:
| Statistic | Value |
|---|---|
| Total number of searches | ~306 million |
| Total volume of searches | ~32 billion |
| Searches with missing volume | 0.51% |
| Mean search volume | 989 |
| Median search volume | 10 |
| Mean CPC | 0.61 |
(For how the volume was calculated, see appendix.)
These are the most popular searches, with volume based on ahref:
| Keyword | Volume |
|---|---|
| youtube | 0.546% |
| 0.530% | |
| amazon | 0.407% |
| gmail | 0.296% |
| 0.271% | |
| weather | 0.164% |
| yahoo | 0.161% |
| ebay | 0.161% |
| walmart | 0.145% |
| yahoo mail | 0.143% |
| netflix | 0.139% |
| google docs | 0.100% |
| translate | 0.098% |
| usps tracking | 0.093% |
| news | 0.091% |
| craigslist | 0.091% |
| fox news | 0.091% |
| cnn | 0.083% |
| calculator | 0.073% |
| hotmail | 0.064% |
| roblox | 0.063% |
| target | 0.063% |
| 0.057% | |
| msn | 0.057% |
| trump | 0.054% |
| 0.054% | |
| bank of america | 0.051% |
| new year | 0.051% |
| maps | 0.050% |
| nfl | 0.044% |
| ups tracking | 0.042% |
| 0.041% | |
| 0.041% | |
| espn | 0.038% |
| disney plus | 0.037% |
| etsy | 0.036% |
| usps | 0.035% |
| finance | 0.033% |
| aol | 0.029% |
| women’s world cup 2019 | 0.026% |
| nba | 0.024% |
| you | 0.023% |
| amazon prime video | 0.022% |
| internet speed test | 0.021% |
| bed bath and beyond | 0.021% |
| ikea | 0.020% |
| dow | 0.018% |
| food near me | 0.018% |
| united airlines | 0.018% |
| speedtest | 0.017% |
The search volume is concentrated among the top searches:
| Searches | Volume |
|---|---|
| Top 50 | 4.9% |
| Top 500 | 8.4% |
| Top 2000 | 12.2% |
| Top 10k | 18.8% |
If we divide the searches into categories based on how much volume they receive, this gives an overview:
These figures together show that low volume searches (individual searches with volume less than 100), constitute 91.8% of all searches, but only 3.3% of all volume. Or stated in another way, 91.8% of keywords get 1-100 volume a month. Whereas very high volume searches (individual searches with volume above 10,000) constitute only 0.2% of all searches, but have 75.2% of all volume.
The top searches have high volume compared to the rest, and the search volume level quickly levels off. The picture looks slightly different depending on whether we use the values from DataforSEO:
Or ahref:
If a misspelling is recognized, a so-called spell type is suggested. There are three types of spell shown in the table below. ~1.4% of searches have a spell type, but those that do have tend to have high volume.
14.1% of searches are in the form of a question. “how” is the most common question word.
A list of stopwords was qcquired from here: https://meta.wikimedia.org/wiki/Stop_word_list/google_stop_word_list#English
“how” and “the” are the most common stopwords, which are present in 6-8% of searches.
A colorful version:
The most searched queries have length 5-10 characters, and falls continuously for search queries longer or shorter than that.
The mean keyword length in characters is 11.2, and the median is 8.5.
Most searches consist of a single word, with volume quickly falling as the number of words increase.
Search terms that are 1-3 words in length get ~10x as much volume on average as search terms that are 5+ words in length.
Internet & Telecom is the keyword category with the highest mean volume.
Arts & Entertainment, Internet & Telecom, and News, Media & Publications have the highest total volume.
Finance has the highest mean cost per click.
The average CPC for all searches is 0.61
Description of keyword difficulty from ahref:
Keyword Difficulty (KD) effectively tells you the average height (backlink profile strength) of the other players in the game (top-ranking pages). The higher the number, the more height (backlinks) you’ll likely to need to stand a chance of ranking.
As volume increases, the difficulty increases.
The slope of the linear regression line is such that for each doubling of the volume, the difficulty increases by 1.63. For example, as the volume goes from 100 to 3200 (6 doublings), the difficulty increases by roughly 1.63 * 6 ~= 10.
Higher difficulty also means higher CPC on average. Note that the Y axis is logarithmic, so a small move on the y axis is a large increase in value.
An alternative visualization of the same data by grouped category in a boxplot:
Search Engine Results Pages (SERPs) are the pages that Google and other search engines show in response to a user’s search query.
97.6% of the SERPs contain at least one SERP feature.
The figure below gives an overview of how often the different SERP features are shown. (Note there are (at least) two additional SERP feature types (Knowledge Panel and Videos), for which the sample size is too small to include.)
The SERP features shown in the most searches are Image pack and People also ask:
The knowledge card has a huge effect in reducing the clicks-per-search, while the other SERP features have limited effect. Searches with the Shopping results SERP feature have higher cps on average.
Easy keywords have fewer SERP features.
Thumbnail & Top stories is the most common SERP feature pairing.
Searches without SERP features tend to be low volume.
Searches with more SERP features have higher mean difficulty.
Return Rate shows how often the same person searches for a given keyword over a 30-day period. A return rate of 1 means that people typically don’t search for that keyword again within 30 days.
We arbitrarily grouped return rate into three groups, No significant return rate (< 2), medium (2-10) and high (10+). Then we look at searches that have similar volume, but different return rate.
As the return rate increases, the number of clicks increases and the cost per click increases.
| Return rate | Mean return rate | Mean CPC | Mean clicks | Mean difficulty |
|---|---|---|---|---|
| High | 17.764996 | 0.9577433 | 71423.308 | 18.39984 |
| Medium | 3.159245 | 0.8680594 | 24901.016 | 16.29606 |
| Low | 1.234423 | 0.6867880 | 2121.582 | 16.77932 |
There is search data from 5 English-speaking countries.
Of those, US and UK have the highest search volume per person. Americans use google 38% more than UK, 108% more than Canada and 90% more than Australia, when adjusted for population.
US of course has much higher total search volume overall:
US has significantly higher cost per click on average
The following is based on analysis with ahref.
International searches have overall higher volume
| region | volume |
|---|---|
| US | 33% |
| International | 67% |
Internationally there are more searches with very low volume, while US has more searches with medium volume.
There is not a large difference in the number of searches with very high volume. However, the total volume of these searches is a lot higher internationally
Searches that have high US volume tend to have high international volume, and vice versa. But there are some exceptions.
A version showing data points binned to hex tiles showing counts: